The use of mixed spatial resolutions in multi-view video coding is a promising approach for coding videos efficiently at\nlow bitrates. It can achieve a perceived quality, which is close to the view with the highest quality, according to the\nsuppression theory of binocular vision. The aim of the work reported in this paper is to develop a new multi-view video\ncoding technique suitable for low bitrate applications in terms of coding efficiency, computational and memory\ncomplexity, when coding videos, which contain either a single or multiple scenes. The paper proposes a new prediction\narchitecture that addresses deficiencies of prediction architectures for multi-view video coding based on H.264/AVC. The\nprediction architectures which are used in mixed spatial-resolution multi-view video coding (MSR-MVC) are afflicted with\nsignificant computational complexity and require significant memory size, with regards to coding time and to the\nminimum number of reference frames. The architecture proposed herein is based on a set of investigations,\nwhich explore the effect of different inter-view prediction directions on the coding efficiency of multi-view video\ncoding, conduct a comparative study of different decimation and interpolation methods, in addition to analyzing\nblock matching statistics. The proposed prediction architecture has been integrated with an adaptive reference\nframe ordering algorithm, to provide an efficient coding solution for multi-view videos with hard scene changes.\nThe paper includes a comparative performance assessment of the proposed architecture against an extended\narchitecture based on the 3D digital multimedia broadcast (3D-DMB) and the Hierarchical B-Picture (HBP)\narchitecture, which are two most widely used architectures for MSR-MVC. The assessment experiments show that\nthe proposed architecture needs less bitrate by on average 13.1 Kbps, less coding time by 14% and less memory\nconsumption by 31.6%, compared to a corresponding codec, which deploys the extended 3D-DMB architecture\nwhen coding single-scene videos. Furthermore, the codec, which deploys the proposed architecture, accelerates\ncoding by on average 57% and requires 52% less memory, compared to a corresponding codec, which uses the\nHBP architecture. On the other hand, multi-view video coding which uses the proposed architecture needs more\nbitrate by on average 24.9 Kbps compared to a corresponding codec that uses the HBP architecture. For coding a\nmulti-view video which has hard scene changes, the proposed architecture yields less bitrate (by on average 28.7\nto 35.4 Kbps), and accelerates coding time (by on average 64 and 33%), compared to the HBP and extended 3DDMB\narchitectures, respectively. The proposed architecture will thus be most beneficial in low bitrate applications,\nwhich require multi-view video coding for video content depicting hard scene changes.
Loading....